Improving the credibility of unreliable information through static images and data mining: an experimental study to identify floods

نویسندگان

  • Sidgley Camargo de Andrade
  • João Porto de Albuquerque
  • Alexandre C. B. Delbem
چکیده

Affected locations by flash floods are rich in information for flood management. Usually, there are several types and sources of information which can be related to achieve better reliability to decision-making. However, a major challenge is to achieve the reliability this information within datasets so heterogeneous or complex. For instance, reports of Volunteered Geographic Information (VGI) through a crowdsourcing-based platform can be confirmed by means of images available on site. Thus, we carried out an experiment to identify water level of river through clustering from static images using an evolutionary method of hierarchical data clustering, called DAta-MIning COde REpositories 1 (Sanches, Cardoso, and Delbem, 2011). Our experiment aimed answering the following question: Is DAMICORE able to find matching clusters between static images gathered from the sensor in-situ and water levels provided by the non-automatic interpretation mechanisms in the riverbed? These mechanisms (Figure 1 (a) water level ruler, (b) puppet, and (c) multi-color band) refer to the hazard index at hydrology field and help volunteers to report into the crowdsourcing-based platform (Degrossi, Albuquerque, Fava, Mendiondo, 2014). Moreover, our dataset 2 contains 288 images categorized in (gray) 124 undefined, (orange) 109 acceptable, (red) 17 high, (dark red) 6 very high, and (blue) 32 flood which were obtained from insitu sensor in a 5-minute temporal resolution on November 23rd, 2015, when a flash flood occurred at 4 p.m. (Figure 1 (d) ) at São Carlos, São Paulo, Brazil. Our preliminaries results have shown a possible matching between clusters found (Figure 2) and interpretation mechanisms of the water level in the riverbed. Therefore, there is evidence that the DAMICORE can support VGI reports collected from dedicated platforms, improving the credibility of information. Nevertheless, further experiments should be performed considering a greater number of images per category and matching between other types of VGI and authoritative data, e.g. social media and sensor.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving security of double random phase encoding with chaos theory using fractal images

This study presents a new method based on the combination of cryptography and information hiding methods. Firstly, the image is encoded by the Double Random Phase Encoding (DRPE) technique. The real and imaginary parts of the encoded image are subsequently embedded into an enlarged normalized host image. DRPE demands two random phase mask keys to decode the decrypted image at the destination. T...

متن کامل

Super-resolution of Defocus Blurred Images

Super-resolution is a process that combines information from some low-resolution images in order to produce an image with higher resolution. In most of the previous related work, the blurriness that is associated with low resolution images is assumed to be due to the integral effect of the acquisition device’s image sensor. However, in practice there are other sources of blurriness as well, inc...

متن کامل

A method to solve the problem of missing data, outlier data and noisy data in order to improve the performance of human and information interaction

Abstract Purpose: Errors in data collection and failure to pay attention to data that are noisy in the collection process for any reason cause problems in data-based analysis and, as a result, wrong decision-making. Therefore, solving the problem of missing or noisy data before processing and analysis is of vital importance in analytical systems. The purpose of this paper is to provide a metho...

متن کامل

Application of Satellite Data and Data Mining Algorithms in Estimating Coverage Percent (Case study: Nadoushan Rangelands, Ardakan Plain, Yazd, Iran)

Assessing and monitoring rangelands in arid regions are important and essential tasks in order to manage the desired regions. Nowadays, satellite images are used as an approximately economical and fast way to study the vegetation in a variety of scales. This research aims to estimate the coverage percent using the digital data given by ETM+ Landsat satellite. In late May and early Ju...

متن کامل

A Proposed Model to Identify Factors Affecting Asthma using Data Mining

Introduction: The identification of asthma risk factors plays an important role in the prevention of the asthma as well as reducing the severity of symptoms. Nowadays, the identification process can be performed using modern techniques. Data mining is one of the techniques which has many applications in the fields of diagnosis, prediction, and treatment. This study aimed to identify the effecti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016